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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/269,118 



AU 1814 



DATE: 11/17/94 
TIME: 16:35:04 



INPUT SET: SI 134. raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 
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SEQUENCE LISTING 



(1) 



General Information: 



(i) APPLICANT: Inouye, Sumiko 
Hsu, Mei-Yin 
Eagle, Susan 
Inouye, Masayori 

(ii) TITLE OF INVENTION: Prokaryotic Reverse Transcriptase 
(iii) NUMBER OF SEQUENCES: 42 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Weiser & Associates 

(B) STREET: 230 South Fifteenth Street, Suite 500 

(C) CITY: Philadelphia 

(D) STATE: Pennsylvania 

(E) COUNTRY: U.S.A. 

(F) ZIP: 19102 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
<B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/269,118 

(B) FILING DATE: 10-JUN-1994 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Weiser, Gerard J. 

(B) REGISTRATION NUMBER: 19,76 3 

(C) REFERENCE/ DOCKET NUMBER: 377.5888P 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-875-8383 

(B) TELEFAX: 215-875-8394 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2176 base pairs 
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RAW SEQUENCE LISTING DATE: 1 1/17/94 

PATENT APPLICATION US/08/269,118 TIME: 16:35:07 

INPUT SET: Sim.raw 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 640.. 2094 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TCATCCGCGC GGACACCCCC TCCTACGTGC CCCCCGACGC GGAGAGCGGC GTGGAGACGG 60 

TGTACCGCGT TTCCCTGGAT GGTCACCTGG TGGCGGTGGA GTGGGGCCCG CGCACGGGCT 120 

CGCCGCGTCA CCAGCGGCTC TGGTTCGACT CGGATGCGGA AGCCCCCGGA GCCTACTTCG 180 

CGCGCCTCGA GAAGTTGGCG GCTGACGGCT ACATCGACGC GGCCTCGGCA TTGGTCTAAA 240 

CCCTTCAACC ACGGCTCGGC CGCCACGCGC GGCCGGCAGG ACAGGTGCGA CGAACAGACG 300 

ACGACGTGCG CTTCACGCGC GAGCAGCCGA GAGAGGTCCG GAGTGCATCA GCCTGAGCGC 360 

CTCGAGCGGC GGAGCGGCGT TGCGCCGCTC CGGTTGGAAT GCAGGACACT CTCCGCAAGG 420 

TAGCCTGTTC TTGGCTCTCT CCCTCCTAGG CACTACGGCC AGGGTGGGTA GCGGAGCCAA 480 

CGACGCCACC GCCGTTTACC CACCCCGGCC GTAGTGCCTA GGAGGGGAGA GCCGGTGAGG 540 

CTACCGTGCC CCAGGTAAGA TGGTGGTGCT TTCCCGGCCT CCGTCGACTG CTCGCGCCAT 600 

GTCCCGTCTT CCATCGCCGC GCCCGCCCAA GGTGCAGAC ATG ACC GCC AGG CTG 654 

Met Thr Ala Arg Leu 
1 5 

GAC CCG TTC GTC CCC GCA GCT TCG CCG CAG GCC GTG CCC ACG CCC GAG 702 
Asp Pro Phe Val Pro Ala Ala Ser Pro Gin Ala Val Pro Thr Pro Glu 
10 15 20 

CTC ACC GCT CCG TCG TCA GAC GCG GCC GCG AAG CGT GAA GCC CGC CGG 750 
Leu Thr Ala Pro Ser Ser Asp Ala Ala Ala Lys Arg Glu Ala Arg Arg 
25 30 35 

CTC GCG CAC GAA GCG TTG CTC GTC CGC GCG AAG GCC ATC GAC GAA GCG 798 
Leu Ala His Glu Ala Leu Leu Val Arg Ala Lys Ala lie Asp Glu Ala 
40 45 50 

GGC GGC GCC GAC GAC TGG GTG CAG GCG CAG CTC GTC TCC AAG GGG CTC 846 
Gly Gly Ala Asp Asp Trp Val Gin Ala Gin Leu Val Ser Lys Gly Leu 
55 60 65 

GCG GTC GAG GAC CTG GAC TTC TCC AGC GCC TCC GAG AAG GAC AAG AAG 894 



• 



PAGE: 3 RAW SEQUENCE LISTING DATE: 1 1/17/94 

PATENT APPLICATION US/08/269,118 TIME: 16:35:09 

INPUT SET: SU34.mw 

100 Ala Val Glu Asp Leu Asp Phe Ser Ser Ala Ser Glu Lys Asp Lys Lys 

101 70 75 80 85 
102 

103 GCC TGG AAG GAG AAG AAG AAG GCC GAG GCC ACC GAG CGC CGC GCG CTG 942 

104 Ala Trp Lys Glu Lys Lys Lys Ala Glu Ala Thr Glu Arg Arg Ala Leu 

105 90 95 100 
106 

107 AAG CGT CAG GCG CAC GAG GCG TGG AAG GCC ACG CAC GTG GGC CAC CTG 990 

108 Lys Arg Gin Ala His Glu Ala Trp Lys Ala Thr His Val Gly His Leu 

109 105 110 115 
110 

111 GGC GCG GGC GTG CAC TGG GCG GAG GAC CGC CTG GCC GAC GCG TTC GAC 1038 

112 Gly Ala Gly Val His Trp Ala Glu Asp Arg Leu Ala Asp Ala Phe Asp 

113 120 125 130 
114 

115 GTG CCC CAC CGC GAG GAG CGC GCC CGG GCC AAC GGC CTG ACG GAG CTG 1086 

116 Val Pro His Arg Glu Glu Arg Ala Arg Ala Asn Gly Leu Thr Glu Leu 

117 135 140 145 
118 

119 GAC TCC GCG GAG GCG CTG GCC AAG GCG CTG GGG CTG AGC GTC TCC AAG 1134 

120 Asp Ser Ala Glu Ala Leu Ala Lys Ala Leu Gly Leu Ser Val Ser Lys 

121 150 155 160 165 
122 

123 CTC CGC TGG TTC GCG TTC CAC CGG GAG GTC GAC ACG GCC ACG CAC TAC 1182 

124 Leu Arg Trp Phe Ala Phe His Arg Glu Val Asp Thr Ala Thr His Tyr 

125 170 175 180 
126 

127 GTG AGC TGG ACC ATT CCG AAG CGG GAC GGC AGC AAG CGC ACG ATT ACG 1230 

128 Val Ser Trp Thr lie Pro Lys Arg Asp Gly Ser Lys Arg Thr lie Thr 

129 185 190 195 
130 

131 TCC CCC AAG CCT GAG CTG AAG GCA GCG CAG CGC TGG GTG CTG TCC AAC 1278 

132 Ser Pro Lys Pro Glu Leu Lys Ala Ala Gin Arg Trp Val Leu Ser Asn 

133 200 205 210 
134 

135 GTC GTG GAG CGG CTG CCG GTC CAC GGC GCC GCC CAC GGC TTC GTG GCG 1326 

136 Val Val Glu Arg Leu Pro Val His Gly Ala Ala His Gly Phe Val Ala 

137 215 220 225 
138 

139 GGA CGC TCC ATC CTC ACC AAC GCG CTG GCC CAC CAG GGC GCG GAC GTC 1374 

140 Gly Arg Ser lie Leu Thr Asn Ala Leu Ala His Gin Gly Ala Asp Val 

141 230 235 240 245 
142 

14 3 GTG GTC AAG GTG GAC CTC AAG GAC TTC TTC CCC TCC GTC ACC TGG CGC 1422 

144 Val Val Lys Val Asp Leu Lys Asp Phe Phe Pro Ser Val Thr Trp Arg 

145 250 255 260 
146 

147 CGG GTG AAG GGC CTG TTG CGC AAG GGC GGC CTG CGG GAG GGC ACG TCC 1470 

148 Arg Val Lys Gly Leu Leu Arg Lys Gly Gly Leu Arg Glu Gly Thr Ser 

149 265 270 275 
150 

151 ACG CTG CTG TCC CTC CTC TCC ACG GAA GCG CCG CGG GAG GCG GTC CAG 1518 

152 Thr Leu Leu Ser Leu Leu Ser Thr Glu Ala Pro Arg Glu Ala Val Gin 



PAGE: 4 RAW SEQUENCE LISTING DATE* 1 1/17/94 

PATENT APPLICATION US/08/269,118 TIME: 16:35:12 

INPUT SET: S1134.raw 

153 280 285 290 

154 

155 TTC CGC GGC AAG CTC CTG CAC GTC GCC AAG GGC CCG CGC GCC CTG CCC 1566 

156 Phe Arg Gly Lys Leu Leu His Val Ala Lys Gly Pro Arg Ala Leu Pro 

157 295 300 305 
158 

15 9 CAG GGC GCC CCC ACG TCG CCC GGC ATC ACC AAC GCG CTC TGC CTG AAG 1614 

160 Gin Gly Ala Pro Thr Ser Pro Gly lie Thr Asn Ala Leu Cys Leu Lys 

161 310 315 320 325 
162 

16 3 CTC GAC AAG CGG CTG TCC GCC CTC GCG AAG CGG CTG GGC TTC ACC TAC 1662 

164 Leu Asp Lys Arg Leu Ser Ala Leu Ala Lys Arg Leu Gly Phe Thr Tyr 

165 330 335 340 
166 

16 7 ACG CGC TAC GCG GAC GAC CTG ACC TTC TCC TGG ACG AAG GCG AAG CAG 1710 

168 Thr Arg Tyr Ala Asp Asp Leu Thr Phe Ser Trp Thr Lys Ala Lys Gin 

169 345 350 355 
170 

171 CCC AAG CCG CGG CGG ACG CAG CGT CCC CCC GTC GCG GTC CTC CTG TCT 1758 

172 Pro Lys Pro Arg Arg Thr Gin Arg Pro Pro Val Ala Val Leu Leu Ser 

173 360 365 370 
174 

175 CGC GTC CAG GAA GTG GTG GAG GCG GAG GGC TTC CGC GTG CAC CCG GAC 1806 

176 Arg Val Gin Glu Val Val Glu Ala Glu Gly Phe Arg Val His Pro Asp 

177 375 380 385 
178 

17 9 AAG ACG CGC GTC GCC CGC AAG GGC ACG CGG CAG CGG GTC ACC GGG CTC 1854 

180 Lys Thr Arg Val Ala Arg Lys Gly Thr Arg Gin Arg Val Thr Gly Leu 

181 390 395 400 405 
182 

18 3 GTC GTG AAT GCG GCG GGC AAG GAC GCG CCC GCG GCC CGA GTC CCG CGC 1902 

184 Val Val Asn Ala Ala Gly Lys Asp Ala Pro Ala Ala Arg Val Pro Arg 

185 410 415 420 
186 

187 GAC GTC GTC CGC CAG CTC CGC GCC GCC ATC CAC AAC CGG AAG AAG GGC 1950 

188 Asp Val Val Arg Gin Leu Arg Ala Ala lie His Asn Arg Lys Lys Gly 

189 425 430 435 
190 

191 AAG CCG GGC CGC GAG GGC GAG TCG CTC GAG CAG CTC AAG GGC ATG GCC 1998 

192 Lys Pro Gly Arg Glu Gly Glu Ser Leu Glu Gin Leu Lys Gly Met Ala 

193 440 445 450 
194 

195 GCC TTC ATC CAC ATG ACG GAC CCG GCC AAG GGC CGC GCC TTC CTG GCT 2046 

196 Ala Phe lie His Met Thr Asp Pro Ala Lys Gly Arg Ala Phe Leu Ala 

197 455 460 465 
198 

199 CAG CTC ACG GAG CTC GAG TCC ACG GCG AGC GCC GCT CCG CAG GCG GAG 2094 

200 Gin Leu Thr Glu Leu Glu Ser Thr Ala Ser Ala Ala Pro Gin Ala Glu 

201 470 475 480 485 
202 

203 TGACGCTCAG CGCGCGTCCG TCGCCGACGT GCCGCGCGCC AGCAACGCCG CATTCAGCAA 2154 
204 

205 CTCCGTCAGC CGGCGCGGGT AC 2176 
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RAW SEQUENCE LISTING date l i/n/94 

PATENT APPLICATION US/08/269,118 TIME: 16:35:15 

INPUT SET: Sim.raw 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val Lys Gin Trp Pro 
15 10 15 

Leu Thr Glu Glu Lys lie Lys Ala Leu Val Glu lie Cys Thr Glu Met 
20 25 30 

Glu Lys Glu Gly Lys lie Ser Lys lie Gly Pro Glu Asn Pro Tyr Asn 
35 40 45 

Thr Pro Val Phe Ala lie Lys Lys Lys Asp Ser Thr Lys Trp Arg Lys 
50 55 60 

Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gin Asp Phe Trp Glu 
65 70 75 80 

Val Gin Leu Gly lie Pro His Pro Ala Gly Leu Lys Lys Lys Lys Ser 
85 90 95 

Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser Val Pro Leu Asp 
100 105 HO 

Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr He Pro Ser He Asn Asn 
115 120 125 

Glu Thr Pro Gly He Arg Tyr Gin Tyr Asn Val Leu Pro Gin Gly Trp 
130 135 140 

Lys Gly Ser Pro Ala He Phe Gin Ser Ser Met Thr Lys He Leu Glu 
145 150 155 160 

Pro Phe Lys Lys Gin Asn Pro Asp He Val He Tyr Gin Tyr Met Asp 
165 170 175 

Asp Leu Tyr Val Gly Ser Asp Leu Glu He Gly Gin His Arg Thr Lys 
180 185 190 

He Glu Glu Leu Arg Gin His Leu Leu Arg Trp Gly Leu Thr Thr Pro 
195 200 205 



PAGE: l SEQUENCE VERIFICATION REPORT DATE: 1 1/17/94 

PATENT APPLICATION US/08/269,118 TIME: 16:35:18 

INPUT SET: S1134.mw 



Line Error 



Original Text 



